Application of Classification Algorithms in Data Mining for Hotspots Occurrence Prediction in Riau Province Indonesia

نویسندگان

  • IMAS SUKAESIH SITANGGANG
  • RAZALI YAAKOB
  • NORWATI MUSTAPHA
چکیده

High fire occurrence in Riau Province, Indonesia has been going on in the recent years with large areas occurring in the peat soil. In this paper a data mining technique namely classification was applied on forest fire data to develop classification models for hotspots occurrence in Riau Province. The models provide characteristics of areas where active fires occurred. We studied physical data including land cover, road, river, city centers, industrial timber plantation, logging concession, peatland depth and peatland types to classify 2693 target objects. Target objects are true alarm data namely hotspots distribution in 2008 and false alarm data which are randomly generated within the areas at least 1 km away from any true alarm data. We applied three classification algorithms that are available in the data mining toolkit Weka 3.6.2: J48 module as Java implementation of C4.5 algorithm, SimpleCart and NaïveBayes. The result shows that the classifier generated from the J48 has highest accuracy i.e. 69.59 % compared to two other algorithms. Our results based on the J48 classifier show that hotspots are predicted to take place in areas that (1) are non logging concession areas, (2) are plantation and dryland forest, and (3) have peatland type: Very deep Hemists/Saprists (> 400 cm). Additionally, hotspot occurrence probability is higher in areas located 10 km from roads, 3 km from rivers and within 5 km to 20 km of city centers where the areas are accessible to humans.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification Model for Hotspot Occurrences Using Spatial Decision Tree Algorithm

Developing a predictive model for forest fires occurrence is an important activity in a fire prevention program. The model describes characteristics of areas where fires occur based on past fires data. It is essential as an early warning system for preventing forest fires, thus major damages because of fires can be avoided. This study describes the application of data mining technique namely de...

متن کامل

Spatial Multidimensional Association Rules Mining in Forest Fire Data

Hotspots (active fires) indicate spatial distribution of fires. A study on determining influence factors for hotspot occurrence is essential so that fire events can be predicted based on characteristics of a certain area. This study discovers the possible influence factors on the occurrence of fire events using the association rule algorithm namely Apriori in the study area of Rokan Hilir Riau ...

متن کامل

Application of Data-Mining Algorithms in the Sensitivity Analysis and Zoning of Areas Prone to Gully Erosion in the Indicator Watersheds of Khorasan Razavi Province

Extended abstract 1- Introduction Gully erosion is one of the most important sources of sediment in the watersheds and a common phenomenon in semi-arid climate that affects vast areas with different morphological, soil and climatic conditions. This type of erosion is very dangerous due to the transfer of fertile soil horizons, and the reduction of water holding capacity also is a factor for s...

متن کامل

Analysis of Pre-processing and Post-processing Methods and Using Data Mining to Diagnose Heart Diseases

Today, a great deal of data is generated in the medical field. Acquiring useful knowledge from this raw data requires data processing and detection of meaningful patterns and this objective can be achieved through data mining. Using data mining to diagnose and prognose heart diseases has become one of the areas of interest for researchers in recent years. In this study, the literature on the ap...

متن کامل

S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization

Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012